Picture for Jifeng Dai

Jifeng Dai

AnyScene: Towards Highly Controllable Driving Scene Generation at Anywhere and Beyond

Add code
May 25, 2026
Viaarxiv icon

Action Emergence from Streaming Intent

Add code
May 14, 2026
Viaarxiv icon

MindVLA-U1: VLA Beats VA with Unified Streaming Architecture for Autonomous Driving

Add code
May 14, 2026
Viaarxiv icon

Driving Intents Amplify Planning-Oriented Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks

Add code
Feb 26, 2026
Viaarxiv icon

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Add code
Nov 18, 2025
Viaarxiv icon

GenExam: A Multidisciplinary Text-to-Image Exam

Add code
Sep 17, 2025
Figure 1 for GenExam: A Multidisciplinary Text-to-Image Exam
Figure 2 for GenExam: A Multidisciplinary Text-to-Image Exam
Figure 3 for GenExam: A Multidisciplinary Text-to-Image Exam
Figure 4 for GenExam: A Multidisciplinary Text-to-Image Exam
Viaarxiv icon

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Add code
Aug 25, 2025
Figure 1 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 2 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 3 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Figure 4 for InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Viaarxiv icon

Spatial Frequency Modulation for Semantic Segmentation

Add code
Jul 16, 2025
Figure 1 for Spatial Frequency Modulation for Semantic Segmentation
Figure 2 for Spatial Frequency Modulation for Semantic Segmentation
Figure 3 for Spatial Frequency Modulation for Semantic Segmentation
Figure 4 for Spatial Frequency Modulation for Semantic Segmentation
Viaarxiv icon

CoMemo: LVLMs Need Image Context with Image Memory

Add code
Jun 06, 2025
Figure 1 for CoMemo: LVLMs Need Image Context with Image Memory
Figure 2 for CoMemo: LVLMs Need Image Context with Image Memory
Figure 3 for CoMemo: LVLMs Need Image Context with Image Memory
Figure 4 for CoMemo: LVLMs Need Image Context with Image Memory
Viaarxiv icon